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["t I ■ We examine regular and irregular repeat-accumulate (RA) codes with repetition degrees which are all even. For 

fS| ' these codes and with a particular choice of an interleaver, we give an upper bound on the decoding error probability 

■ of a linear-programming based decoder which is an inverse polynomial in the block length. Our bound is valid for 

any memoryless, binary-input, output-symmetric (MBIOS) channel. This result generalizes the bound derived by 
Feldman et al., which was for regular RA(2) codes. 



C/3 



> 
(N 



Keywords: Coding theory, repeat-accumulate codes, linear-programming (LP) decoding, upper bound, 
error performance. 

I. Introduction 



' Since the discovery ifTOll of Turbo codes in 1993, there has been much focus on understanding why 

^ ' they perform superbly as they do. The discovery of Turbo codes also sparked an abundance of research 

into LDPC codes which were originally discovered by Gallager ifTTI . This vast study of Turbo and LDPC 
j_j ■ codes, as well as their many variations, has mainly been with respect to two types of decoders: Optimal 

maximum-likelihood (ML) and sub-optimal iterative message-passing algorithms. The latter have been 
extensively researched with several variations of the decoding algorithm, producing in some cases an 
accurate understanding of the decoder performance. 

Recently, a novel decoding scheme based on linear programming (LP) was proposed. Initially, an 
LP-based decoder was proposed for Turbo codes by Feldman et al |[13i] with an explicit performance 
bound given for repeat-accumulate (RA) codes, a variant of Turbo codes. Later, another LP-based decoder 
was proposed for LDPC codes by the same authors |[T2l . These results, among others, have been well- 
summarized in m. Further results for the LP decoder of LDPC codes include the characterization of 
pseudocodewords, and in particular, minimum-weight pseudocodewords (e.g., HI, 191); results for the 
binary symmetric channel (BSC) on the error-correction capability (e.g., ||5l, |[T6l ). and others. 



One interesting property of the LP decoder is the ML certificate property. That is, that whenever the 
LP decoder outputs a codeword, it is guaranteed to be the ML codeword. Iterative message-passing 
algorithms do not share this property. On the other hand, iterative algorithms have in some cases the 
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advantage of lower decoding complexity, as compared to LP decoding. However, for LDPC codes this 
advantage is all but eliminated (see |[T4l . ifTSl ). 

Compared to iterative decoding, there has so far been less research on LP-based decoding. While the 
first analytic result for LP decoding |[T3l has been for the case of RA codes, most of the results for LP 
decoding thereafter refer to LDPC codes. In |[T3l . regular RA(2) codes were examined, based on flow 
theory and graph-theoretical arguments. Halabi and Even ||6l have proposed a better bound on RA(2) 
codes which is based on a more careful examination of the underlying graph-theoretical nature of the 
problem. Irregular RA code ensembles have been shown to achieve excellent performance under iterative 
message-passing decoding. For example, in the BEC there are known capacity-achieving sequences of 
codes (see e.g., Q). Motivated by these results for iterative decoding, we examine regular and irregular 
RA codes under LP decoding. We show how to extend the results of |[T3l to regular RA((/) codes for 
even q and to irregular RA ensembles where all repetition degrees are even. The essential novelty in this 
work is the application of Euler's (graph-theoretic) theorem to an appropriately defined (hyperpromenade) 
graph. 

The remainder of the paper is organized as follows. Preliminary material is given in Section JI] 
Section Hn] contains the derivation of our error bound for regular RA codes. A discussion of these 
results as well as their extension to irregular RA codes appears in Section |lVl Section |V] concludes the 
paper. 

II. Preliminaries 

In this section, we give our nomenclature and some necessary preliminary material. Our notations 
largely follow those of Feldman [H. In the rest of the paper, we will deal exclusively with repeat- 
accumulate (RA) codes. These codes were proposed by Divsalar et al. in ||3l, in which regular code 
ensembles were defined, and later generalized to irregular ensembles in Repeat-accumulate codes 
feature a simple encoder structure and are known to have good decoding performance under iterative 
message-passing decoding. The encoder of a regular RA{q) code, shown in Fig. [T] takes an input block 
of k bits, applies a g-fold repetition code to obtain a block of n = qk bits, interleaves the block and 
finally feeds it into a rate- 1 accumulator. The accumulator is a recursive convolutional encoder with one 
memory element which outputs at time index t simply the mod-2 sum of the inputs up to time t. The 
code rate in this case is R = |. In an irregular code, the number of times a bit is repeated (or its 
repetition degree) is not constant. The fraction of bits which are repeated a certain number of times by 
the encoder is known as the degree distribution, and it is usually expressed either in vector form or in 
polynomial form. 

Our analysis will focus on transmission of regular or iiTcgular RA codes over memoryless, binary- 
input, output-symmetric (MBIOS) channels. We denote by G {0, 1}, z = 1, . . . , the i'th bit of the 
codeword to be transmitted and by Ui = y{xi) the channel modulation of the i'th bit. Since the channel is 
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Fig. 1. Block diagram of a regular RA encoder. 



memoryless, the i'th received symbol depends only on yi by the conditional probability law P{yi\yi) 
imposed by the channel. 

The log-likelihood ratio (LLR) 7^ is defined as 



Example 1: In the binary symmetric channel (BSC) the channel input alphabet is binary, and so we 
have yi = x,. The log-likelihood ratio is 7^ = In (^^^^ if ?7j = and 7^ = In (^j^^ if 27i = 1- 

Example 2: Consider the binary-input additive white gaussian noise (AWGN) channel. Following 
conventional notation, we map bit to +1 and bit 1 to —1, i.e., we have y^ = 1 — 2xj. In the AWGN 
channel we have 

yi = yi + Zi 

where Zi is a normally-distributed random variable, Zi ~ J\f{0,a^). The LLR in the AWGN channel is 
easily shown to be 7^ = 

It is convenient for purposes of analysis to rescale the LLR. In the BSC, the rescaling enables to have 
7i = 1 if yi = and 7^ = — 1 if = 1. In the AWGN channel, rescaling allows us to express 7i = 

A. A Linear Program to Decode Repeat-Accumulate Codes 

We are interested in the performance of a linear-programming (LP) decoder for RA codes. To make 
our presentation self-contained, we briefly present the linear program proposed by Feldman [I]. 

First, we look at the accumulator section of the encoder, assuming at this stage that it were the entire 
encoder. The accumulator is a rate- 1 convolutional encoder, and has a state diagram and trellis as shown 
in Fig. |2] The trellis T features connections or edges describing transitions between states from successive 
time intervals, which are labeled according to the output of the accumulator. Each edge also has a 'type' 
which depends upon the input bit triggering the transition. Note that the trellis contains an extra layer 
used to terminate the code. Adding the extra bit to force the encoder back to the zero-state incurs a 
small loss in the code rate, but makes analysis more convenient, since each codeword corresponds to 
a "cycle" rather than an arbitrary path in T. All edges have a direction (i.e., forward in time). Also, 
every edge e € T is assigned a cost 7e = 7i where the index i is selected according to the trellis 
segment containing the edge. Consequently, it can be shown that finding the ML codeword is equivalent 
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(a) State diagram of an accumulator 




(b) The trellis for an accumulator 



Fig. 2. State diagram and trellis of an accumulator. Each state transition has a label according to the output of the accumulator. 
Solid lines denote state transitions when the input bit is zero, and dashed lines are used when the input bit is one. 

to finding the minimum-cost path traversing across the trellis. If indeed the accumulator were the entire 
code, there would be no additional constraints on the path, and finding the one with mininal cost could 
be accomplished, for example, by using the Viterbi algorithm. 

We now take the effect of the (possibly irregular) repetition code and interleaver into account. Assume 
that input bit xt has repetition degree qt- If so, we would expect the inputs into the accumulator at 
some set of indices = {i^, i^, . . . , z'^'*} to be identical (obviously, this set depends on the interleaver). 
Translating this into trellis terms, we require that for all t = 1, . . . , A; we have the same type of edge at 
all layers i ^ Xt. Any path satisfying this requirement is called an agreeable path. 

A linear program to decode RA codes (RALP) was defined by Feldman HI as follows: 

RALP: minimize ^^7e/e s.t. 

eer 

f- = ^ (2) 

E E VsGr\{.[|,sO} (3) 

e£out{s) e€m(s) 

Xt = Yfe \/iGXt, t = l,...,k (4) 

< /e < 1 Ve G T 

where m(s) is the set of edges entering node s, out{s) is the set of edges exiting node s, and /j = 
{{s^-ijsj), {sj_i,s^)} is the pair of "input-1" edges entering layer i. Equation dlj) ensures that one unit 
of flow is sent across the trellis. Equation ^ enforces flow conservation at each node, i.e., that whatever 
flow enters must also exit. The agreeability constraints are imposed by equation (01). These constraints 
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say that a feasible flow must have, for all Xj, t = 1, . . . ,k, the same amount xt of total flow on input- 1 
edges at every segment i ^ Xt. 

In order to use RALP as a decoder, one should solve the LP problem above on the trellis with edge 
costs 7e defined by the received vector y, thus obtaining an optimum point {f*,x*). If /* is integral 
(i.e., all values are or 1), x* is output as the decoded information word. If not, the output is "error". 
We refer to this algorithm as the RALP decoder. It can be shown that this decoder has the ML certificate 
property: whenever it finds a codeword, it is guaranteed to be the ML codeword. 

III. An Error Bound for Regular RA{q) Codes with Even q 

In this section, we derive an upper bound on the decoding error probability of the RALP decoder. For 
simplicity, we deal in this section exclusively with regular codes. This is an extension of the results of 
im, which applied to RA(2) codes, to the case of RA(g) codes for even q. For the purpose of analysis, 
we define an auxiliary graph which contains subgraphs called hyperpromenades which carry a meaning 
similar to error events in convolutional codes. The structure of these hyperpromenades suggests a design 
of an interleaver. We show how to design a suitable interleaver, and show that the RALP decoder has 
an inverse-polynomial error rate (in the blocklength n) when this interleaver is used. Our discussion will 
not depend initially on the repetition degrees being even; we will only require this assumption later on. 

Let be a weighted undirected graph with n vertices {gi, ■ ■ ■ , Qn) connected in a line. We call these 
edges Hamiltonian, as they form a Hamiltonian path along the graph. We associate a cost (weight) 
c[gi,gi+i] with each Hamiltonian edge {gi,gi^i), equal to the cost added by decoding code bit i to the 
opposite value of the transmitted codeword. Formally, we have 

c[5i,5i+i] = 7i (1 - 2xi) (5) 

where Xi is the i'th codeword bit, and 7^ is the log-likelihood ratio of code bit i, as defined in ([T]). In 
the BSC, we have c[gi,gi+i] = +1 if Xi = yi = yi and c[gi,gi+i] = -1 if / y^. Naturally, the 
decoder does not know the costs c[gi,gij^i]; they are used solely as a means for analysis. In addition to 
the Hamiltonian edges described above, contains also hyperedges connected between the vertices. A 
Q-hyperedge is an edge connecting q vertices, and is formally defined as an unordered g'-tuple of vertices 
from the graph. We connect a total of k hyperedges, where hyperedge t contains the vertices within the 
index set (t = 1, . . . , A;). Note that according to this setting, exactly one hyperedge is connected to 
every vertex. In [H, where the authors consider the case q = 2, these extra edges form a matching on 
the vertices of the graph. Extending this nomenclature to any q, we will call them matching hyperedges. 
These edges are defined to have zero cost in the auxiliary graph. 

An atom path ij,{a, r) is a walk which begins at vertex g^ and finishes at vertex g^, using Hamiltonian 
edges only. Therefore, if o" < r, we have /^(o", r) = (51^, 5cr+ii • • • 1 S'r-i, 5t)- A hyperpromenade ^ is 
a set of atom paths, possibly with multiple copies of the same atom path in the set. The set ^' is also 
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required to satisfy a certain "agreeability" constraint. Formally, define, for each segment i in the trellis 
where 1 < i < n, the following multiset Bi'. 

-Bj = {/U G : /.f = /.f((7, r), where i = a or i = t} 

Note that if multiple copies of some fi{a, r) exist in ^, then Bi contains multiple copies as well. We 
say that ^ is a hyperpromenade if, for all t = 1 . . . , A;, where Xt = {t^, t^, . . . , f^}, we have 

\Bt^\ = iBt^l = ■ ■ ■ = \Bti\ (6) 



Example 3: As an example, consider the auxiliary graph illustrated in Figure |3] In this auxiliary graph 
of an RA(4) code, the multiset 

^ = {^^(1, 2), ^(1, 2), ^(3, 10), ^(4, 5), /i(4, 12), ^(5, 7), ^^(6, 11), /i(7, 12), ^(8, 9), ^(8, 9)} 

is a hyperpromenade. 




gi 92 93 94. 9h 96 97 9s 99 9w 9u 9i2 



Fig. 3. The auxiliary grapli in example [5] 

The cost of every atom path fi{a, r) is equal to the sum of the costs of its edges. The cost of a 
hyperpromenade is equal to the sum of the costs of the atom paths it contains, including repeated ones. 

We have the following theorem ([1, Theorem 6.13]). 

Theorem 1: For any regular RA((7) code, the RALP decoder succeeds if all hyperpromenades have 
positive cost. The RALP decoder fails if there is a hyperpromenade with negative cost. 

This theorem was stated in [1], and was therein proved for the case of g = 2. The same proof applies 
also for q > 2. However, in order to use this result we need to show how to construct graphs B which 
yield good interleavers for RA{q) codes; we will show that these graphs have a small probability of 
having a negative-cost hyperpromenade. A key metric in our analysis will be the girth of the auxiliary 
graph. As the auxiliary graph contains hyperedges as well as regular edges (thus it is a hypergraph), 
the notion of girth needs to be extended. Define a path p = {po, . . . ,pk) in the auxiliary graph to be a 
series of vertices where every two consecutive vertices are connected by an edge or a hyperedge. The 
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only exception is that the same (hyper)edge may not be traveled two times in a row; this means that 
U-turns are not allowed, and also roundabouts within a hyperedge (e.g., if 11,12,13 G Xt for some t, then 
{9ii ~^ 9i2 ~^ 5*3) is i^ot ^ valid path). Aside from this restriction, a path may repeat vertices, edges and 
hyperedges. Path length is measured in edges, so the path p = {po, . . . ,pk) has length k. A cycle is a 
path that begins and ends in the same vertex. The girth of a hypergraph is thus the length of its shortest 
cycle. We further define a simple path (resp. simple cycle) to be a path (cycle) which does not repeat 
Hamiltonian edges but may repeat hyperedges. 

Our first step is to show that an auxiliary graph G with high girth can be constructed, thus implying 
the existence of appropriate interleavers. For the case of g = 2, Erdos and Sachs ifTTl (see also 0) 
have shown a construction for such an interleaver. The following result is an extension to g > 3 using 
a similar technique. While our subsequent error bound is valid only for even q, this restriction need not 
be imposed yet. 

Theorem 2: Let n = qk be the block length of a regular RA(g) code with q > 3 and n > g^. Then one 
may construct for this code an auxiliary graph which is a Hamiltonian line plus k g-hyperedges which 
form a matching, so that the auxiliary graph has girth no less than g = [logg nj — 1. 

Proof: See appendix lAl 

Denote the interleaver produced by this approach by vr^;. The next step is to study the auxiliary graph 
of an RA code which uses vr^; as an interleaver. We focus on the underlying nature of hyperpromenades 
in this graph. 

A study of the structure of hyperpromenades. First, we point out that in the case of g = 2, it 
was shown by Feldman HI that every hyperpromenade is equivalent to a cycle in gQ. This observation 
simplifies the analysis, as one must deal solely with simple cycles in the auxiliary graph. In our case 
where q > 2, this is not necessarily true. Therefore, our conclusions must be based only on the definition 
of a hyperpromenade. 

Our goal will be to provide an upper bound on the probability that the auxiliary graph contains a 
negative-cost hyperpromenade. Let ^ be any hyperpromenade in 0. We construct a graph called the 
hyperpromenade graph as follows: 

1) For every atom path ii{(t,t) G ^, draw in 0$ two vertices, labeled a and r, according to the 
endpoints of the atom path. Connect the two vertices by an edge. If ;u(cr, r) appears more than 
once in 4*, we will have multiple replicas of this structure, accordingly. 

2) Merge all vertices with the same label into one vertex. At this stage, the graph may no longer be 
simple, i.e., there may be vertex pairs connected by more than one edge. 

3) Add the matching hyperedges to the graph. 

'in fact, this was the original definition of a promenade, to which the hyperpromenade reduces for the case g = 2. 
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By this construction, it is obvious that one can reconstruct any hyperpromenade given its hyperpromenade 
graph, i.e., there is a 1 — 1 relation between hyperpromenades and their graphs. We further assign a cost 
to every edge {a, r) in as follows : c[{a, r)] = c[fi{a, r)] where c[fi{a, r)] is the cost of the atom 
path in 0. Hyperedges in 0^ are assigned zero cost. With this definition, the total cost of the edges 
in 0,1, is the same as the cost of the hyperpromenade. We note that the hyperpromenade graph may or 
may not be connected (in the sense that there is a path between any two of its vertices). If it is, we call 
the hyperpromenade connected. This property is different from the connectedness of the auxiliary graph, 
since now vertices common to different atom paths are ignored unless they are at the endpoints. As an 
example, we draw the hyperpromenade graph of ^ from Example |3] in Figure H] In this example, the 
hyperpromenade is not connected and has two connected components. 

Let be a hyperpromenade which is not connected, and consider the corresponding graph, 0^. It 
is easy to verify that each of its connected components is the hyperpromenade graph of a valid hyper- 
promenade. Therefore, ^ can be partitioned into disjoint connected hyperpromenades ^i, 4*2, ... , 'I'm 
satisfying c[^] = c[^i] + • • • + c[^j\/]. If ^ is a negative-cost hyperpromenade, it thus must have a 
component with negative cost. Therefore, by Theorem [H the probability that the RALP decoder fails is 



3 10 
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Fig. 4. The hyperpromenade graph 0* of the hyperpromenade from Example [S] 



the same as the probability of having a connected hyperpromenade with negative cost. 

The next step is to establish that it is enough to look at simple paths and cycles which are contained 
in a connected negative-cost hyperpromenade. In the following, we assume n = g^'^^ for some integer 
/ to avoid floors and ceilings, although our arguments do not rely on this. 

Theorem 3: Let be the auxiliary graph of a regular RA(g) code with q even. Assume has girth at 
least g, where g = log^ n — 1 (by Theorem |2j this girth is attainable). If there exists a hyperpromenade 
in with c[^] < 0, then there exists a simple path or cycle y in that contains | Hamiltonian edges, 
and has cost c[Y] < 0. 
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Proof: Let ^' be a hyperpromenade with c\^] < 0. By the discussion above, we may assume w.l.o.g. 
that ^' is connected. First, we will show that there is a cycle H = {fiQ, hi, ... , h^^^ = Kq) in Q which has 
c[H] = c[^], where c[H] is measured along the edges of H. Draw the hyperpromenade graph and 
contract the matching hyperedgej^ The result is a graph, with no hyperedges, where vertex a has degree 
Since q is even, all degrees are even and we can find an Eulerian tour C in i.e., a simple 
cycle which passes through all the edges. Since every edge in 0.1, has the same cost as its corresponding 
atom path in ^, we have c[C] = c[^]. By adding back the matching hyperedges and tracing along the 
atom paths making up ^, we get from C the desired cycle ^ in with c[H] = c\^]. Now, contract the 
matching hyperedges in H. Denote by H = {ho, hi, ... , h\u\ = ho) the contracted version of H. 

No two matching hyperedges share an endpoint, and by definition the same hyperedge cannot be used 
twice in a row. Therefore, at most every other edge of is a matching hyperedge. Thus, and since H 
is a cycle, 

\H\ > -\H\ > -g 
I I - 2i 1-2^ 

Write out the cost of H explicitly as 

\H\-1 

c[H] = E c[hi,hi+i] 

i=0 

Let Hi = {hi, . . . , be a subsequence of H containing g/2 edges, and let 

c[Hi]= c[hj,hj+i] 
j=i 

Hi must be a simple path (or a simple cycle), i.e., it can have no repeated Hamiltonian edges; otherwise, 
by adding the matching hyperedges back into H, this would imply the existence of a cycle in of length 
less than g. Note that 

c[H] = c[H] 

* ~" ' 1=0 

since every edge is counted exactly ^g times. Now, if c[H] = c[^] < 0, then there must be a simple 
path or cycle ffj. such that c[ffj.] < 0. Adding back the matching hyperedges, we get the desired simple 
path or cycle Y. ■ 

Theorem |3] asserts that if there are no negative-cost simple paths or cycles, then there is no 
corresponding negative-cost hyperpromenade. It is also the first point in our derivation which requires to 
use the assumption that q is even. We will now use this to get an error bound for LP decoding under the 
BSC. This bound extends HJ Theorem 6.5]. 

Theorem 4: Consider a regular RA(g) code {q even) with block length n, and tte constructed in the 
proof of Theorem |2] as an interleaver. Assume that the code is transmitted over the BSC. Let e > 

^contracting the hyperedge unites all vertices it connects into one vertex, retaining any other edges connected to the original 
vertices. 
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be some positive number. If the transition probability p satisfies p < ^"^("^+^+5 '°s,(4g-2))^ ^.j^^^^ when 
decoded using the RALP decoder, the code has word error probabiUty 

WEP < K{logg n) ■ n"' (7) 

where K is a positive constant. 

Proof: By theorems [T] and [3l the decoder will succeed if all simple paths or cycles in G with 
= ^(loggTi — 1) (this equality is attained by definition of tte) Hamiltonian edges have positive cost. 
We claim there are at most n{2q — l)^^ simple paths and cycles with Hamiltonian edges. To see this, 
build a simple path or cycle by choosing any vertex gi^ and traversing a simple path beginning with a 
Hamiltonian edge. There are at most two choices for the first edge. If, after traversing the Hamiltonian 
edge, we arrive at a vertex gi^, then from gi^ we can choose to proceed along the second Hamiltonian 
edge connected to it, or traverse a hyperedge. If a hyperedge is traversed, there are at most 2{q — 1) 
possible choices for the next Hamiltonian edge. This gives a total of 2g — 1 choices for the second 
Hamiltonian edge. Proceeding in this manner, we see that there are no more than {2q— l)^^ simple paths 
or cycles with ^g Hamiltonian edges beginning from the vertex gi^. Choosing an arbitrary starting vertex 
gives a total of no more than n{2q — 1) 2^ possible simple paths or cycles. In the BSC, each Hamiltonian 
edge has cost —1 or 1. Therefore, in any simple path or cycle Y, at least half of the edges must have 
cost —1 in order to have c[Y] < 0. Consequently, we have 



Pr (c[y] <o)=J2 -p)^^"'- < jalj^^p'' (8) 



WEP < n{2q-l)^'^^g(^l^^p-4 



k=i9 

Applying the union bound over all possible choices of Y (i.e., paths with ^g = ^ (log^ n — 1) Hamiltonian 
edges), we have 

< ^(loggn) •n^+2l°§«(2g-l)^ilog,2^ilog,p.^~i 

= K(Iogg n) ■ 

where K = ^p ■ 

A similar result for the AWGN channel, which extends HI Theorem 6.6] follows. 

Theorem 5: Consider a regular RA(g) code (q even) with block length n, and tte constructed in the 
proof of Theorem [2] as an interleaver. Assume that the code is transmitted over the binary-input AWGN. 
Let e > be some positive number. If the noise variance satisfies ^ >41ng(l + e+ ^ Iogg(2g — 1)), 
then the code has, when decoded using the RALP decoder, word error probability 



WEP <kJ n-^ (9) 
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where K is a. positive constant. 

Proof: In the AWGN channel, each Hamiltonian edge in the auxiliary graph has cost 

c[9i,9i+i] = 7i (1 - 2a;i) = JiVi = {Vi + Zi)yi = 1 + 

where 'Zi ^ M (O, cr^). Therefore, if Y is any simple path or cycle with = \{\ogq n—1) Hamiltonian 
edges, we have c\Y] = | + Z, where 

^~AA(0,|a2) 

For a random variable X ~ AA(0, s^), we have the following inequality: for all x > 0, 

Pr {X>x)< -^=e-^ (10) 
xv2vr 

Using ([Tol l we get 



Pr(c[y] < 0) = Pr(Z > |) 



< \\—e-^ (11) 
-Kg 



As in Theorem IH using the union bound over all possible choices of Y gives 

WEP < n(2g - 1)^3 • Pr (c[y] <0) 



■Kg 



7r(logg n - 1) 



V log„ n-l 



where K = \l —. 



A similar result for general MBIOS channels is given here in Theorem [6] The proof follows the lines 
of the proofs of theorems |4] and [51 and is omitted. 

Theorem 6: Consider a regular RA(g) code {q even) with block length n, and tte constructed in the 
proof of Theorem |2] as an interleaver. When transmitted over an MBIOS channel and decoded using the 
RALP decoder, the code has word error probability WEP satisfying 

/i{log,n-l) \ 

WEP < n(2g - l)2(i°g, "-i) . Pr ^ < (12) 

where Zj, i = 1, . . . , ^(logg n—1) are i.i.d. random variables with cumulative distribution function 

Pr {zi < z) = Pr (7j < z\xi = 0) 



We note that The BSC and AWGN error bounds in (|7]l and ^ can be derived as special cases of (fT2ll . 
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TABLE I 

BSC TRANSITION PROBABILITY THRESHOLDS ENSURING VANISHING ERROR PROBABILITY, AS DERIVED FROM THEOREmE) 



IV. Discussion and Numerical Results 

In the last section, we have given explicit bounds on the decoding error probability for the RALP 
decoder. While these bounds apply to regular RA(g) codes with even q, it is possible to extend the error 
bounds to irregular RA codes, where all repetition degrees are even. This is apparent if we note that the 
proofs of Theorems [T] and [3]-[6] do not make any assumption on the regularity of the code. The fact that 
all repetition degrees must be even is required in the proof of Theorem [3] (this, both for the regular and 
irregular cases). However, we are unable to provide an extension of Theorem |2] to irregular codes. In 
other words, the construction of an interleaver which yields an auxiliary graph with girth g = log^ n — 1 
does not easily extend to the irregular case. Still, whatever girth may be achieved for a specific auxiliary 
graph of an irregular RA code can be used to apply Theorem [6] to any MBIOS channel. That is, if a girth 
g' can be achieved for an irregular graph (rather than g = log^ n — 1 when tte is used as an interleaver), 
we would have that 

WEP < n(2g,„,, - 1)59' . pr z^ < oj (13) 

instead of (fT2l ). where q^a^ now denotes the maximum repetition degree; the quantity n(2gmax — 1)^^ 
is a revised bound on the number of possible simple paths and cycles with ^g' Hamiltonian edges in 
the irregular graph. It particularizes to the expression in the proof of Theorem |4] in the special case of a 
regular code. 

In Table U we give the thresholds for the BSC transition probability for which the error bound d?]) 
decays to zero, for some choices of q. It is apparent that the threshold worsens as q increases. This 
is in contrast to our expectation that coding performance should improve with the reduction of coding 
rate. Obviously, one cause for this is that having a negative cost path or cycle with Hamiltonian 
edges is only a necessary condition for decoding failure. It would be reasonable to conjecture that, as 
q increases, many structural restrictions other than this condition must exist in order to have a negative- 
cost hyperpromenade. Also, the reliance on a union bound over all possible simple paths and cycles 
undermines the tightness of the bound. Furthermore, if we were to examine an irregular RA code, it can 
be seen that the bound in ([T3] ) would yield the same result for the irregular code as well as for a regular 
code with repetition degree qmax- Thus the possible improvement (increase) in the coding rate obtained 
by reducing the repetition degrees of some information symbols is not reflected in our bound. 

We further note that the improvement over Feldman's work presented in ||6l for g = 2 does not seem 
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to extend to g > 2. This is due to the following. In ||6l, an improvement for g = 2 is obtained by a 
careful characterization of cycles in the auxiliary graph; since for (7 = 2 every cycle is a promenade, 
(thus finding a cycle is a sufficient condition for identifying a promenade) the study in fSJ successfully 
captures all error events necessary for an upper bound. The distinction between the q = 2 case and q > 2 
is that in the latter case, not every cycle is a valid hyperpromenade. Consequently, analysis of cycles 
alone is insufficient in order to obtain an upper bound on the decoding error probability for q > 2. 

V. Summary 

We have presented an upper bound on the word error probability of regular and irregular RA codes 
transmitted over MBIOS channels and decoded by the RALP decoder. This bounding technique extends 
the one presented by Feldman |[T1 for regular RA(2) codes to the case of regular RA(g) codes and to 
irregular RA codes with even repetition degrees. Our technique essentially reUes on applying Ruler's 
graph-theoretic theorem to an appropriately-defined graph (i.e., the hyperpromenade graph). 

Appendix A 
Proof of Theorem [2] 

Theorem |2l Let n = qk he. the block length of a regular RA(g) code, g > 3 and n > q^. Then one 
may construct for this code an auxiliary graph which is a Hamiltonian line plus k g-hyperedges which 
form a matching, so that the auxiliary graph has girth no less than g = [logg nj — 1. 

In the proof we will assume n is a power of q to avoid using floor and ceiling notations. The proof 
easily extends to the general case. 

Proof: Let H he. n Hamiltonian cycle with n = vertices. Let Eq he the set of edges in H, and 
V the set of vertices (this is denoted hy H = [V, Eq)). Let D denote the set of all possible g-hyperedges, 
and let ^ C D satisfy the following conditions 

1) No vertex is incident with more than one g-hyperedge in A. 

2) The girth of Ha = (V, Eq {j A) is not less than g. 

Then we shall show that ii\A\ < q9, there exists ^+ C D such that |^+| = |^| + 1 and satisfies [B 
and|2l). By repeatedly applying this result we obtain that there exists some set A with \A\ = q^ satisfying 
the above two conditions. 

Let dA he the distance function in Ha- Let V2{A) C V denote the set of vertices with degree 2 in Ha, 
i.e., those which are not incident with any g'-hyperedge in A. Given that \A\ < q^, it follows that V2{A) 
has at least q members. If some set of q vertices pi,p2, ■ ■ ■ ,Pq G V2{A) is such that dA{pi,Pj) > g — 1 
for all z, j G {1, . . . , q}, i 7^ j, then the set = {piP2 ■ ■ - Pq] satisfies the required conditions. 

Suppose there is no such set of q vertices. Define t to be the maximum number of vertices pi, . . . ,pt € 
V2{A) such that dA{pi,Pj) > g — 1 for all i,j G {!,...,*}, i / j. By our assumption we have that 



14 



i ^ t < q — 1. Select vertices pi, . . . ,pt which achieve this maximum. Let 

Driz) = {veV\dAiz,v)<r} (14) 

We claim that 

V2{A) C [/' 4 D,^i{pi) U Dg^i{p2) U . . . Dg^iipt) (15) 

This is easily seen, as follows. Suppose there is a vertex pt+i G V2{A)\U'. Then the set pi, . . . ,pt,pt+i 
is such that dA{pi,Pj) > g — I for alH, j G {1, . . . , t + 1}; this is in contradiction with the definition of 
t. 

Set pi, . . . ,pt according to the definition above, and choose pt+i, ■ ■ ■ ,Pq ^ V2{A) arbitrarily. For any 
X G V2{A) the set Dg-i{x) has size at most 

l + 2 + 2g + 2g2 + ... + 2g3-2 = 1 + 2^^ ^ (16) 

q - 1 

Consequently, \f U = Dg^i{pi) [j Dg^i{p2) U ' ' ' U Dg-i{pq), then 

\U\ < \Dg.i{pi)\ + |Dg-i(p2)| + • • • + \Dg-i{pq)\ < q + 2g^^^^ (IV) 

Let W = V\U. Since \V\ = q^~^^, it follows from the preceding inequality that 

\W\ > q3+^ - q - 2q'^^—^ (18) 
q-1 

Let pi,P2, . . . ,Pq G W he arbitrary vertices, and let U = Dg^i{pi) |J Dg^i{p2) U " " U ^g-iiPq)- This 
situation is depicted in Figure |5] 

We have that Z?g_i(pi) has size at most 

l + {l + q) + {l + q)q+{l + q)q^ + ••• + (! + q)q^''^ = ! + (! + q)'t zA (19) 

q-l 

and therefore \U\ satisfies 

09-l _ 1 (a) 

U <q + q{l + Q V < \W\ (20) 

where (a) stems from plugging in the expression from ([TS] ) and applying some algebra §. Now, (|20l ) 
implies that there exist vertices si, S2, ■ ■ ■ ,Sg G W such that any pair i j has dA{si, Sj) > g — I. 
To see this, note that one may select si, S2, ■ ■ ■ , Sg G W sequentially, as follows: first select si G TV 
arbitrarily, then select for i = 2, . . . , g 

i 

s,eW\\jDg^i{sj) (21) 
j=i 

arbitrarily; (l20l ) ensures that the set in the RHS of (|2TI ) is nonempty. We further have by definition of 
that none of the vertices si,S2, ■ ■ ■ ,Sq are in V2{A). Therefore, for every Si,i = I, . . . ,q there is a distinct 
hyperedge SjS.^-* . . . sf'\ Consider these g-hyperedges, sis^^^ . . . s^f\ 828^2^ . . . Sg"^-*, . . . , SqS^^ . . . s\''\ 

^one needs to assume here that q > 3 and g > 3; g > 3 follows from the assumption that n > q*. 
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Since all vertices in W have distance at least g from pi,p2, ■ ■ ■ ,Pq, it follows that Sj , 2 < i < q, 
1 < j < Q all have distance at least g — 1 from pi,p2, ■ ■ ■ ,Pq. Therefore, the set 

A+ = A\j{p^sf\..s['\p2S^^K..4\pqs(^K..s^^^\s^S2...,Sq} 

\ '['^I'^l • • • '^l'^'^) '^252 , SgS^ ^ . . . •S^'^-' j- 

satisfies the required conditions. 

Once we have built a circle of vertices and a g^-fold matching between them, the theorem follows by 
removing one of the edges along the circle; this is the nonexistent edge in the graph between the first 
and last nodes. Removing this edge does not reduce the girth of the auxiliary graph, and completes the 
desired construction. ■ 

Discussion. The proof of the theorem incurs bounding the size of the neighborhood sets Dg^i{-). We 
could have tightened the bounds we used, e.g. in (fT6l ). by noting that if a matching edge is traversed, the 
next level neighbor can be only one of two choices (a Hamiltonian edge must be used next). This would 
have replaced equations ([T6l ) and ([T9l ) with more elaborate, albeit precise expressions. Consequently, the 
girth bound would have improved by a constant factor at most. This is of lesser importance than the 
ultimate behavior of the girth of the graph which is logarithmic in the block length. We thus omit this 
refinement. 

Our proof uses a construction to show it is possible to build the desired graph. We note that this 
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construction contains degrees of freedom which can lead to different results. 

The complexity of the proposed construction. We will show that the complexity of constructing the 
matching above is polynomial in n, and in particular that it is no more than 0(n''"'"^) in time and space. 
To show this, we go over the stages of the construction and bound their complexity (some technical 
details are omitted). 

1) The basic iteration step in the construction involves adding a hyperedge to the graph. This step is 
performed k = n/q times. We therefore examine the worst-case complexity of this basic step and 
multiply the result by k. 

2) In each iteration, construct for every vertex x ^ V the set Dg^i{x), in the form of a list. This 
entails a complexity of no more than n ■ = 0{'n?), since is an upper bound on the size of 
Dg_i{x) (see Eq. ([Hi). 

3) Using the lists constructed in step (2), we need to determine if there exists a set of vertices 
Pi,P2, ■ ■ ■ ,Pq G V2{A) such that dA{pi,Pj) > g — 1, i j . It can be seen that the complexity of 
this step is no more than 0{n'^~^^). This bound includes the complexity associated with finding 
the vertices pi,...,pt defined in the construction. If t = q, the iteration step ends. If not, we 
need to construct the set W and find the vertices si, S2, ■ ■ ■ , Sg G W such that any pair i / j has 
dA{si,Sj) > g -1. 

4) The construction of the set W makes use of the precalculated lists of neighbors, and can be seen 
to have complexity no more than 0(n). 

5) The final step requires finding vertices si,S2, ■ ■ ■ ,Sq € W such that any pair i ^ j has dA{si, Sj) > 
g — I. Using the sequential construction described above, the worst-case complexity can be seen to 
be O(n^). Once the vertices are found, the matching hyperedges are added and the iteration step 
ends. 

The total complexity of constructing a high-girth interleaver is thus no more than 

^ (0(n2) + 0(71"+!) + 0(n) + 0(n2)) = 0(n''+2) 
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